Generic Finite Element Programming for Massively Parallel Flow Simulations
نویسندگان
چکیده
We present design issues, data structures and algorithms which allow us to leverage the power of high performance computer clusters in generic finite element codes. Large scale flow simulations require parallel algorithms to scale to thousands of processor cores. Simple parallelization in generic finite element codes often reduces to implementing parallel linear algebra and solvers or interfacing with an existing parallel linear algebra library. Other data structures and algorithms, e.g. mesh handling, are not designed for parallel computations and introduce bad scaling and memory constraints. Scaling to more than about one hundred cores and a few million of unknowns is not feasible. Today’s computer clusters have up to tens of thousands of cores and are the foundation to deal with large scale numerical problems. Generic finite element codes, like deal.II , feature methods for rapid and flexible development, higher order elements and adaptivity. To take advantage of those features in large parallel computer clusters, e.g. in flow simulations, many parts of the library have to be adapted and tailored to scale. We describe the steps done in enabling deal.II to scale from less than a hundred cores and a few million unknowns up to thousands of cores and billions of unknowns: data structures for distributed fully-adaptive mesh handling, data structures for efficient indexing of degrees of freedom, algorithms for efficiently distributing the degrees of freedom, and more. Distributing storage of data and local – instead of global – communication plays a key role in this process. The parallelization is done in a general setting applicable to any generic finite element library, c.f. . We show numerical results that demonstrate the applicability of our design to generic finite element problems, and that our algorithms scale very efficiently to billions of unknowns.
منابع مشابه
Massively Parallel Finite Element Programming
Today’s large finite element simulations require parallel algorithms to scale on clusters with thousands or tens of thousands of processor cores. We present data structures and algorithms to take advantage of the power of high performance computers in generic finite element codes. Existing generic finite element libraries often restrict the parallelization to parallel linear algebra routines. T...
متن کاملParallel Curved Mesh Adaptation for Large Scale High-Order Finite Element Simulations
This paper presents the development of a parallel adaptive mesh control procedure designed to operate with high-order finite element analysis packages to enable large scale automated simulations on massively parallel computers. The curved mesh adaptation procedure uses curved entity mesh modification operations. Applications of the curved mesh adaptation procedure have been developed to support...
متن کاملMassively parallel adaptive reconstruction methods for real data based numerical simulations
In recent years, imaging techniques have well improved in many sectors, providing accurate numerical descriptions from 2D or 3D images, with applications in different fields, from medical research to material science. In this paper, a methodology to build a numerical description under the mesh format has been implemented and is used for massively parallel finite element numerical simulations, d...
متن کاملDUNE-FEM. A general purpose discretization toolbox for parallel and adaptive scientific computing
DUNE-FEM is a free discretization toolbox for parallel and adaptive scientific computing based on DUNE. The implementation of discretization schemes such as finite elements, finite volumes or discontinuous Galerkin schemes is based on abstractions that are very close to the mathematical description of the underlying methods. In this contribution we will give a compact overview on the design and...
متن کاملMassively Parallel Computing: Unstructured Finite Element Simulations
Massively parallel computing holds the promise of extreme performance. Critical for achieving high performance is the ability to exploit locality of reference and eeective management of the communication resources. This article describes two communication primitives and associated mapping strategies that have been used for several diierent un-structured, three-dimensional, nite element applicat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010